Search CORE

242 research outputs found

DYNAMO-MAS: a multi-agent system for ontology evolution from text

Author: N Aussenac-Gilles
P Cimiano
P Cimiano
S Aubin
S Lemouzy
V Tamma
VI Levenshtein
ZS Harris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2013
Field of study

International audienceManual ontology development and evolution are complex and time-consuming tasks, even when textual documents are used as knowledge sources in addition to human expertise or existing ontologies. Processing natural language in text produces huge amounts of linguistic data that need to be filtered out and structured. To support both of these tasks, we have developed DYNAMO-MAS, an interactive tool based on an adaptive multi-agent system (adaptive MAS or AMAS) that builds and evolves ontologies from text. DYNA-MO-MAS is a partner system to build ontologies; the ontologist interacts with the system to validate or modify its outputs. This paper presents the architecture of DYNAMO-MAS, its operating principles and its evaluation on three case studies

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

HAL Descartes

Lexical alignment in triadic communication

Author: Cimiano P.
Foltz A.
Gaspers J.
Stenneken P.
Thiele K.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Lexical alignment refers to the adoption of one's interlocutor's lexical items. Accounts of the mechanisms underlying such lexical alignment differ (among other aspects) in the role assigned to addressee-centered behavior. In this study, we used a triadic communicative situation to test which factors may modulate the extent to which participants' lexical alignment reflects addressee-centered behavior. Pairs of naive participants played a picture matching game and received information about the order in which pictures were to be matched from a voice over headphones. On critical trials, participants did or did not hear a name for the picture to be matched next over headphones. Importantly, when the voice over headphones provided a name, it did not match the name that the interlocutor had previously used to describe the object. Participants overwhelmingly used the word that the voice over headphones provided.This result points to non-addressee-centered behavior and is discussed in terms of disrupting alignment with the interlocutor as well as in terms of establishing alignment with the voice over headphones. In addition, the type of picture (line drawing vs. tangram shape) independently modulated lexical alignment, such that participants showed more lexical alignment to their interlocutor for (more ambiguous) tangram shapes compared to line drawings. Overall, the results point to a rather large role for non-addressee-centered behavior during lexical alignment

Kölner UniversitätsPublikationsServer

Frontiers - Publisher Connector

PubMed Central

Bangor University Research Portal

NL-Graphs: A hybrid approach toward interactively querying semantic data

Author: A. Bangor
D. Kelly
E. Kaufmann
K. Elbedweihy
L. Han
P. Cimiano
R. Bhagdev
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A variety of query approaches have been proposed by the semantic web community to explore and query semantic data. Each was developed for a specific task and employed its own interaction mechanism; each query mechanism has its own set of advantages and drawbacks. Most semantic web search systems employ only one approach, thus being unable to exploit the benefits of alternative approaches. Motivated by a usability and interactivity perspective, we propose to combine two query approaches (graph-based and natural language) as a hybrid query approach. In this paper, we present NL-Graphs which aims to exploit the strengths of both approaches, while ameliorating their weaknesses. NL-Graphs was conceptualised and developed from observations, and lessons learned, in several evaluations with expert and casual users. The results of evaluating our approach with expert and casual users on a large semantic dataset are very encouraging; both types of users were highly satisfied and could effortlessly use the hybrid approach to formulate and answer queries. Indeed, success rates showed they were able to successfully answer all the evaluation questions

CiteSeerX

Crossref

Sheffield Hallam University Research Archive

Reviewing the problem of learning non-taxonomic relationships of ontologies from text

Author: Alexiev V.
Allen J.
Buitelaar P.
Cimiano P.
Dale R.
IVO SERRA
Mitchell T.
PAULO NOVAIS
ROSARIO GIRARDI
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2012
Field of study

Learning Non-Taxonomic Relationships is a sub-field of Ontology Learning that aims at automating the extraction of these relationships from text. This article discusses the problem of Learning Non-Taxonomic Relationships of ontologies and proposes a generic process for approaching it. Some techniques representing the state of the art of this field are discussed along with their advantages and limitations. Finally, a framework for Learning Non- Taxonomic Relationships being developed by the authors is briefly discussed. This framework intends to be a customizable solution to reach good effectiveness in the process of extraction of non-taxonomic relationships according to the characteristics of the corpus.This work is supported by CNPq, CAPES and FAPEMA, research funding agencies of the Brazilian government

Universidade do Minho: RepositoriUM

Crossref

The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas

Author: AE Cano-Basave
B Cherrier
CE Lipscomb
DM Blei
F Osborne
F Osborne
F Osborne
F Osborne
G Wohlgenannt
H Saif
KM Livingston
L Ding
LMA Bettencourt
P Cimiano
P Clough
S Fathalla
SE Middleton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2018
Field of study

Ontologies of research areas are important tools for characterising, exploring, and analysing the research landscape. Some fields of research are comprehensively described by large-scale taxonomies, e.g., MeSH in Biology and PhySH in Physics. Conversely, current Computer Science taxonomies are coarse-grained and tend to evolve slowly. For instance, the ACM classification scheme contains only about 2K research topics and the last version dates back to 2012. In this paper, we introduce the Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 26K topics and 226K semantic relationships. It was created by applying the Klink-2 algorithm on a very large dataset of 16M scientific articles. CSO presents two main advantages over the alternatives: i) it includes a very large number of topics that do not appear in other classifications, and ii) it can be updated automatically by running Klink-2 on recent corpora of publications. CSO powers several tools adopted by the editorial team at Springer Nature and has been used to enable a variety of solutions, such as classifying research publications, detecting research communities, and predicting research trends. To facilitate the uptake of CSO we have developed the CSO Portal, a web application that enables users to download, explore, and provide granular feedback on CSO at different levels. Users can use the portal to rate topics and relationships, suggest missing relationships, and visualise sections of the ontology. The portal will support the publication of and access to regular new releases of CSO, with the aim of providing a comprehensive resource to the various communities engaged with scholarly data

Crossref

Open Research Online (The Open University)

The history of the development of wireless telegraphy and broadcasting in Australia to 1942, with especial reference to the Australian Broadcasting Commission : a political and administrative study

Author: E Cabrio
M Chein
P Becker
P Cimiano
P du Boucher-Ryan
PE Van Der Vet
S Steyskal
V Rodríguez-Doncel
Publication venue: 'The University of Sydney Library'
Publication date: 01/01/1961
Field of study

Licences are a crucial aspect of the information publishing process in the web of (linked) data. Recent work on modeling of policies with semantic web languages (RDF, ODRL) gives the opportunity to formally describe licences and reason upon them. However, choosing the right licence is still challenging. Particularly, understanding the number of features - permissions, prohibitions and obligations - constitute a steep learning process for the data provider, who has to check them individ- ually and compare the licences in order to pick the one that better fits her needs. The objective of the work presented in this paper is to reduce the effort required for licence selection. We argue that an ontology of licences, organized by their relevant features, can help providing support to the user. Developing an ontology with a bottom-up approach based on Formal Concept Analysis, we show how the process of licence selection can be simplified significantly and reduced to answering an average of three/five key questions

CiteSeerX

Crossref

Open Research Online (The Open University)

Sydney eScholarship

Klink-2: integrating multiple web sources to generate semantic topic networks

Author: DM Blei
F Osborne
F Osborne
F Osborne
F Osborne
G Wohlgenannt
P Cimiano
S Peroni
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/10/2015
Field of study

The amount of scholarly data available on the web is steadily increasing, enabling different types of analytics which can provide important insights into the research activity. In order to make sense of and explore this large-scale body of knowledge we need an accurate, comprehensive and up-to-date ontology of research topics. Unfortunately, human crafted classifications do not satisfy these criteria, as they evolve too slowly and tend to be too coarse-grained. Current automated methods for generating ontologies of research areas also present a number of limitations, such as: i) they do not consider the rich amount of indirect statistical and semantic relationships, which can help to understand the relation between two topics – e.g., the fact that two research areas are associated with a similar set of venues or technologies; ii) they do not distinguish between different kinds of hierarchical relationships; and iii) they are not able to handle effectively ambiguous topics characterized by a noisy set of relationships. In this paper we present Klink-2, a novel approach which improves on our earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web. In particular, Klink-2 analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. Our experimental evaluation shows that the ability of Klink-2 to integrate a high number of data sources and to generate topics with accurate contextual meaning yields significant improvements over other algorithms in terms of both precision and recall

CiteSeerX

Crossref

Open Research Online (The Open University)

3LD: towards high quality, industry-ready linguistic Linked Licensed Data

Author: Aguado de Cea G.
Cimiano Philipp
Gómez-Pérez A.
McCrae John P.
Rodríguez Doncel Víctor
Vila Suero Daniel
Publication venue: E.T.S. de Ingenieros Informáticos (UPM)
Publication date: 01/01/2014
Field of study

The application of Linked Data technology to the publication of linguistic data promises to facilitate interoperability of these data and has lead to the emergence of the so called Linguistic Linked Data Cloud (LLD) in which linguistic data is published following the Linked Data principles. Three essential issues need to be addressed for such data to be easily exploitable by language technologies: i) appropriate machine-readable licensing information is needed for each dataset, ii) minimum quality standards for Linguistic Linked Data need to be defined, and iii) appropriate vocabularies for publishing Linguistic Linked Data resources are needed. We propose the notion of Licensed Linguistic Linked Data (3LD) in which different licensing models might co-exist, from totally open to more restrictive licenses through to completely closed datasets

Archivo Digital UPM

Extracting and Modeling Historical Events to Enhance Searching and Browsing of Digital Cultural Heritage Collections

Author: I. Niles
P. Cimiano
R. Shaw
W. Cohen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref